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Abstract. We prove in this paper that there exists some infinitary ra- 
tional relations which are analytic but non Borel sets, giving an answer 
to a question of Simonnet [Sim92]. 
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1. Introduction 

Acceptance of infinite words by finite automata was firstly considered by Biichi in 
order to study decidability of the monadic second order theory of one successor 
over the integers [Biic62] . Then the so called w-regular languages have been inten- 
sively studied and many applications have been found, see [Tho90] [Sta97] [PPOl] 
for many results and references. 

Rational relations on finite words were studied in the sixties and played a fun- 
damental role in the study of families of context free languages [Ber79]. Their 
extension to rational relations on infinite words was firstly investigated by Gire 
and Nivat [GirSl] [GN84]. Infinitary rational relations are subsets of x Ej, 
where Ei and E2 are finite alphabets, which are recognized by Biichi transducers 
or by 2-tape finite Biichi automata with asynchronous reading heads (there exists 
an extension to subsets of E" x Ej x . . . x E!;^ recognized by n-tape Biichi automata, 
with El, . . . , E„ some finite alphabets, but we shall not need to consider it). Since 
then they have been much studied, in particular in connection with the rational 
functions they may define, see for example [CG99] [BCOO] [Sim92] [Sta97] [PriOO] 
for many results and references. 
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The question of the complexity of such relations on infinite words naturally arises. 
A way to investigate the complexity of infinitary rational relations is to consider 
their topological complexity and particularly to locate them with regard to the 
Borcl and the projective hierarchies. It is well known that every w-language ac- 
cepted by a Turing machine with a Biichi or MuUer acceptance condition is an 
analytic set, [Sta97], thus every infinitary rational relation is an analytic set. Si- 
monnet asked in [Sim92] whether there exists some infinitary rational relation 
which is an analytic but non Borel set. We give in this paper a positive answer to 
this question showing that there exists some non Borel (and even Sj-complete) 
infinitary rational relation. The paper is organized as follows. In section 2 we 
introduce the notion of transducers and of infinitary rational relations. In section 
3 we recall definitions of Borel and analytic sets, and we prove our main result in 
section 4. 



2. Infinitary rational relations 

Let S be a finite alphabet whose elements are called letters. A non-empty finite 
word over E is a finite sequence of letters: x = aia2 . . . a„ where V-i S [1; n] ai £ S. 
We shall denote x{i) = Ui the i*^ letter of x and x[i] = x{l) . . . x{i) for i < n. The 
length of X is |a;| = n. The empty word will be denoted by A and has letter. Its 
length is 0. The set of finite words over E is denoted S*. E"*" = E* — {A} is the 
set of non empty words over E. A (finitary) language L over E is a subset of E*. 
The usual concatenation product of u and v will be denoted by u.v or just uv. For 

V C E*, we denote V* = {vi . . . Vn/n e N and Vi€V Vi € [1; n]}. 

The first infinite ordinal is w. An w-word over S is an w -sequence aia2 . . . a„ . . ., 
where G E, V« > 1. When a is an w-word over E, we write a = fT(l)a(2) . . . a{n) . . 
and a[n] = a{l)a{2) . . . <7{n) the finite word of length n, prefix of a. The set of 
w- words ovcir the alphabet E is denoted by E'^. An w-language over an alphabet 
E is a subset of E". For F C E*, F'^ = {ct = ui ...«„... e E'^/uj e V, Vi > 1} 
is the w-power of V. The concatenation product is extended to the product of a 
finite word u and an a>-word v: the infinite word u.v is then the w-word such that: 
{u.v){k) = u(k) iik < \u\ , and {u.v){k) = v(k — \u\) if fc > |w|. 
The prefix relation is denoted C: the finite word w is a prefix of the finite word 

V (respectively, the infinite word w), denoted u Q v, if and only if there exists a 
finite word w (respectively, an infinite word w), such that v = u.w. 

We assume the reader to be familiar with the theory of formal languages and of 
w-regular languages, see [Biic62] [Tho90] [EH93] [Sta97] [PPOl] for many results 
and references. We recall that w-regular languages form the class of w-languages 
accepted by finite automata with a Biichi acceptance condition and this class is 
the omega Kleene closure of the class of regular finitary languages. 
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Wc arc going now to introduce the notion of infinitary rational relation which 
extends the notion of w-regular language, via definition by Biichi transducers; 

Definition 2.1. A Biichi transducer is a sextuple T = {K, E, T, A, qo, F), where 
K is & finite set of states, S and T are finite sets called the input and the output 

alphabets, A is a finite subset oi K xY,* xT* x K called the set of transitions, qo 

is the initial state, and F C K is the set of accepting states. 

A computation C of the transducer T is an infinite sequence of transitions 

{qo,ui,Vi,qi), {qi,U2,V2,q2), ■ ■ ■ {qi-i,Ui,Vi,qi), {qi,Ui+i,Vi+i,qi+i), . . . 

The computation is said to be successful iff there exists a final state qf € F and 
infinitely many integers i >0 such that qi = qf. 
The input word of the computation is u = U1.U2.U3 . . . 
The output word of the computation is v = V1.v2.v3 . . . 
Then the input and the output words may be finite or infinite. 
The infinitary rational relation R{T) C T," x recognized by the Biichi trans- 
ducer T is the set of couples {u, v) G x such that u and v are the input and 
the output words of some successful computation C of T. 
The set of infinitary rational relations will be denoted RAT. 

Remark 2.2. An infinitary rational relation is a subset of x T" for two finite 

alphabets E and F. One can also consider that it is an a;-language over the finite 
alphabet E x F. If (m, v) e E'^ x F'^, one can consider this couple of infinite words 
as a single infinite word {u{l),v{l)).{u{2),v{2)).{u{3),v{3)) . . . over the alphabet 
E x F. We shall use this fact to investigate the topological complexity of infinitary 
rational relations. 

3. BOREL AND ANALYTIC SETS 

We assume the reader to be familiar with basic notions of topology which may be 
found in [Kur66] [Mos80] [Kec95] [LT94] [Sta97] [PPOl]. 

For a finite alphabet X having at least two letters we shall consider A" as a 
topological space with the Cantor topology. The open sets of X" are the sets in 
the form W.X^, where W C X*. A set i C X'^ is a closed set iff its complement 
X" — L is an open set. We define now the next classes of the Borel Hierarchy: 

Definition 3.1. The classes I]° and n° of the Borel Hierarchy on the topological 
space X^ are defined as follows: 

is the class of open sets of X'^. 
11° is the class of closed sets of X'^. 
And for any integer n > 1: 

^n+i is the class of countable unions of n°-subsets of X". 

^n+i is the class of countable intersections of S°-subsets of A". 

The Borel Hierarchy is also defined for transfinite levels. The classes and n^, 

for a countable ordinal a, are defined in the following way: 
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S° is the class of countable unions of subsets of X" in U-y<an°. 

n° is the class of countable intersections of subsets of X'^ in U-y<aS°. 

There are also some subsets of X'^ which arc not Borel sets. In particular the class 
of Borel subsets of X'^ is strictly included into the class S]; of analytic subsets 
of X'^ . A subset A of X" is an analytic set iff there exists another finite set 
Y and a Borel subset B oi {X x YY such that x e A 'r^ 3y e Y'^ such that 
(x, y) G B, where [x, y) is the infinite word over the alphabet X xY such that 
{x,y){i) = {x{i),y{i)) for each integer i>l. 

Recall also the notion of completeness with regard to reduction by continuous 
functions. If a be a countable ordinal, a set F C X" is said to bo a S° (respectively 
n°, Si)-completc set iff for any set E CY'^ (with Y a finite alphabet): G S° 
(respectively E G 11°, Sj) iff there exists a continuous function / : X'^ 
such that E = /"^(F). 

A I]° (respectively 11°, Sj;)-complete set is a S° (respectively 11°, Sj)- set 
which is in some sense a set of the highest topological complexity among the S° 
(respectively 11° , S^)- sets. S° (respectively n°)-complete sets, with n an integer 
> 1, are thoroughly characterized in [Sta86]. 

The w-language A = (0*.l)'^ is a well known example of n°-complete set which 
will be used below. It is the set of w-words over the alphabet {0, 1} with infinitely 
many occurrences of the letter 1. 

4. Si-COMPLETE INFINITARY RATIONAL RELATIONS 
We can now state our main result: 

Theorem 4.1. There exists some T,\-complete (hence non Borel) infinitary ra- 
tional relations. 

Proof. We shall use here results about languages of infinite binary trees whose 
nodes are labelled in a finite alphabet S. 

A node of an infinite binary tree is represented by a finite word over the alphabet 
{l,r} where r means "right" and I means "left". Then an infinite binary tree 
whose nodes are labelled in S is identified with a function t : {I, r}* S. The set 
of infinite binary trees labelled in S will be denoted T^. 

There is a natural topology on this set [Mos80], [LT94], [Sim92]. It is defined 
by the following distance. Let t and s bo two distinct infinite trees in . Then the 
distance between t and s is ^ where n is the smallest integer such that t{x) ^ s{x) 
for some word x G {I, r}* of length n. 

The open sets are then in the form Tq.T^ where Tq is a set of finite labelled trees. 
Tq.T^ is the set of infinite binary trees which extend some finite labelled binary 
tree to &To, to is here a sort of prefix, an "initial subtree" of a tree in to.T^- 
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For an alphabet E having at least two letters the topological space is liomeo- 
morphic to the Cantor set S'^ . Borel and analytic subsets of are defined from 
open sets in the same manner as in the case of the topological space E'^. 

Let t be a tree. A branch S of t is a subset of the set of nodes of t which is linearly 
ordered by the tree partial order C and which is closed under prefix relation, i.e. 
if X and y are nodes of t such that y G B and x 'Qy then x G B. 
A branch S of a tree is said to be maximal iff there is not any other branch of t 
which strictly contains B. 

Let t be an infinite binary tree in . If S is a maximal branch of t, then this 
branch is infinite. Let (ui)i>o be the enumeration of the nodes in B which is 

strictly increasing for the prefix order. 

The infinite sequence of labels of the nodes of such a maximal branch B, i.e. 
t{uo)t{u\)....t{un) is called a path. It is an w-word over the alphabet S. 

Let then L C S'^ be an w-language over S. We denote Path{L) the set of infinite 
trees t in such that t has at least one path in L. 

It is well known that if L C S'^ is an w-language over S which is a Ila-complete 
subset of (or a Borel set of higher complexity in the Borel hierarchy) then the 
set Path{L) is a S];-complete subset of T^. Hence in particular Path(L) is not a 
Borel set, [Niw85] [Sim93] [Sim92]. 

Whenever i? C S'^ is a regular tj-language. we shall find a rational relation 
i? C (S U {A}Y X (S U {A}Y and a continuous function 

: ^ ((S U {A}) X (S U {A}))- 
such that Path{B) = h~^{R). For that we shall code trees labelled in E by words 
over the finite alphabet (E U {A}) x (E U {A}) where A is supposed to be a new 
letter not in E. 

Consider now the set {I, r}* of nodes of binary infinite trees. For each integer 
n> 0, call C„ the set of words of length n of {I, r}*. Then Co = {A}, Ci = {I, r}, 
C2 = {II, Ir, rl, rr} and so on. C„ is the set of nodes which appear in the (n + 1)*'^ 
level of an infinite binary tree. The number of nodes of C„ is card{Cn) = 2". We 
consider now the lexicographic order on C„ (assuming that I is before r for this 
order). Then, in the enumeration of the nodes with regard to this order, the nodes 
of Ci will be: I, r; the nodes of C3 will be: III, llr, Irl, Irr, rll, rlr, rrl, rrr. 
Let u", w", be such an enumeration of C„ in the lexicographic order and 
let Vi , ...jVj", be the enumeration of the elements of C„ in the reverse order. 
Then for all integers n > and i, 1 <i < 2", it holds that w" = «2"+i-i- 

We define now the code of a tree t in T^. Let A be a new letter not in S. The code 
of the tree t is an w-word a over the alphabet (E U {A}) x (E U {A}) which may 
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be written in the form (0-1,(72), where ai and a2 are w-words over the alphabet 
(EU{A}). 

The a;-word (Ti enumerates the labels of the nodes of tlw^ trcx^ t which appear at 
levels 1, 3, 5, . . . , 2n+ 1, . . . , i.e. at odd levels. More precisely the word cti begins 
with the label t{vi) of the node at level 1, followed by an A, followed by the labels of 
the nodes of the third level enumerated in the reverse lexicographic order, i.e. 
t{vl)t{v2)t{vl)t{vl) , followed by an A, followed by the labels of the nodes of the 5*'* 
level enumerated in the reverse lexicographic order, i.e. t{vf)t{v2)t{v^) . . .t(wfg), 
and so on . . . 

For each integer n > 0, the labels of the nodes of C2n, enumerated in the reverse 
lexicographic order, are placed before those of C2n+2 and these two sets of 
labels are separated by an A. 

The construction of the w-word C72 is very similar but it successively enumerates, 
in the lexicographic order, the labels of nodes occuring at even levels. So the 
word a2 is in the form 

(72 = t{ul)t{ul)At{ul)t{ul)t{ul)t{ul)t{ul)t{ul)t{u^)t{ul)A.. . 

For each integer n > 0, the labels of the nodes of C2„+i are enumerated before 
those of C2n+3 and these two sets of labels are separated by an A. Moreover the 
labels of the nodes of C2n+i, for n > 0, are enumerated in the lexicographic 
order (for the nodes). 

Let then h be the mapping from into {{EU{A}) x (i;u{A}))" such that for every 
labelled binary infinite tree t of , h{t) is the code (ai, 0-2) of the tree as defined 
above. It is easy to see, from the definition of h and of the order of the enumeration 
of labels of nodes (they are enumerated level after level in the increasing order), 
that his a continuous function from into ((S U {A}) x (S U {A}))". 

Now we are looking for a rational relation R such that for every tree t & T^, 
h{t) e i? if and only if t has a path in B. Then we shall have Path{B) = h~^{R). 

We shall first describe the rational relation R which is an w-language over the 
alphabet ((S U {A}) x (E U {^})). Every word of R may be seen as a couple 
y = (j/i, 2/2) of a)-words over the alphabet S U {A}. Now y = (2/1,1/2) is in R if and 
only if it is in the form 

2/1 = x(l).ui.A.V2-x(i).u^.A.Vi.x{b).U5.A. . . . A.V2n-x{2n + l).U2n+i.^ . . . 

2/2 = Vi.x{2).U2-A.V3.x{4).U4.A A.V2n+l-X{2n + 2).U2n+2-A . . . 

where for all integers i>l, x{i) e S and Ui,Vi G S* and 

|wj|=2|ui| or \vi\ = 2\ui\ + 1 
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and the w-word x = x{l)x{2) . . . x{n) ... is in B. 

If such an w-word y = (2/1,1/2) is the code h{t) of a tree t e T^, then a;(l) = t{vi) 
and Ml = A, then |fi| = 2|ui| = or = 2|ui| + 1 = 1. Therefore if |wi| = 
then x{2) = t{ul) and if |ui| = 1 then x{2) = t{ul). Then the choice of |vi| = 2|ui| 
or of \vi I = 2|ui| + 1 implies that x{2) is the label of the left or the rigth successor 
of the root node Vi = A. 

By construction this phenomenon will happen for further levels. The choice of 

\vi\ = 2\ui\ or of \vi\ = 2\ui \ + 1 determines one of the two successor nodes of a 
node at level i thus the successive choices determine a branch of t and the labels 
of nodes of this branch form a path x{l)x{2)x{3) . . . x{n) . . . which is in B. Thus 
for a tree t G T^, h{t) e R if and only if t G Path{B) then Path{B) = h-'^{R). 
Remark that R does not contain only codes of trees but such a code h{t) is in R 
iff f G Path{B) and this fact suffices for our proof. 

Hence if B is a Borel set which is a Ila-complete subset of (or a set of higher 
complexity in the Borel hierarchy), the language h~^{R) = Path{B) is a S^- 
complcte subset of T^. Then the w-language R is at least Sj-complete because 
ft, is a continuous function. 

Note that here /i is a continuous function: ((SU{A}) x {Y.VS{A})Y and the 

preceding definition of S];-complcte set involves continuous reductions: X'^ y"; 
but the two topological spaces and Y"^ have good similar properties (they 
are zero-dimensional polish spaces, see [PPOl] [Kec95] [Sim92], in fact they are 
homeomorphic) which enable to extend the previous definition to this new case. 
Indeed i? is a S^-complete subset of ((E U {A}) x (E U {A}))'^ because every 
infinitary rational relation is a S^-set. 

Then in that case R is not a Borel set because a S];-complcte set is not a Borel 
set. This gives infinitely many non Borel infinitary rational relations, because 
there exist infinitely many Il^-complete w-regular languages. 

It remains to show that if B is an w-regular language then R is an infinitary ratio- 
nal relation. In fact this is easy to see from the definition of R. We shall explicitely 

give a Blichi transducer defining R in the following simple case: E = {0, 1} and 
B = (0*.!)" is a well known example of Ila-complete w-regular language. 

The infinitary rational relation R is then recognized by the following Biichi trans- 
ducer r = {K, (E U {A}), (E U {A}), A, qo,F), where 

K = {qo,qi,q2,q3,q4,qi,ql,q2,Qi} 

is a finite set of states, {0,1,^} is the input and the output alphabet, qq is the 
initial state, and F = {ql,ql} is the set of accepting states. Moreover A C 
if X (E U {A})* X (E U {A})* X K is the finite set of transitions, containing the 
following transitions: 
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(go,0,A,gi) and (go,l,A,gi), 

{qi,u,v,qi), for all words u,?; e S* with \u\ = 1 and \v\ = 2, 
(ft, A, w, 92), for V G {0,1, A}, 
(g2,A0,g?) and {q2,A,l,ql), 

{q,u,v,qs), for all w, w e S* with |u| = 2 and \v\ = 1 and q € {91,(71,(73}, 
(g,u,A,g4), for MS {0,1, A} and g G {gf, 91,93}, 

(94, 0, A, q^) and (94, 1, A, q\), 

(9^, A, A, 91) and (9^, A, A, 91). □ 

Remark 4.2. We could of course have avoided the set of transitions to contain 
sonic transitions with both the input and the output words being empty, like the 
two last ones: (93, A, A, 91) and {q\, A, A, 91). 

Remark 4.3. We have shown that there exists some infinitary rational relations 
which are Sj-complete hence non Borel. In particular this implies that these 
infinitary rational relations are not arithmetical sets because every arithmetical 
set is a Borel set (of finite rank). We refer to [Sta86] [Sta97] for definitions and 
results about the arithmetical hierarchy over sets of infinite words over a finite 
alphabet S. 

Remark 4.4. From the preceding example we can easily find a Si-complete 
infinitary rational relation in the form where 5 is a rational relation over finite 
words, see [Ber79] [GirSl] [PriOO] about finitary rational relations. 

Acknowledgements. Thanks to Jean-Pierre Ressayre and Pierre Simonnet for 
useful discussions. 
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